Overview

Dataset Statistics

Number of Variables 7
Number of Rows 113662
Missing Cells 9017
Missing Cells (%) 1.1%
Duplicate Rows 3665
Duplicate Rows (%) 3.2%
Total Size in Memory 46.5 MB
Average Row Size in Memory 429.1 B

Variable Types

Categorical 6
Numerical 1

Variables

TRANS DATE

categorical

Distinct Count 801
Unique (%) 0.7%
Missing 1
Missing (%) 0.0%
Memory Size 8.1 MB

Length

Mean 10
Standard Deviation 0
Median 10
Minimum 10
Maximum 10

Sample

1st row 2015-10-20
2nd row 2015-10-09
3rd row 2015-10-06
4th row 2015-10-29
5th row 2015-10-02

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 227322
Decimal Number 909288

TRANS VAT DESC

categorical

Distinct Count 6
Unique (%) 0.0%
Missing 7998
Missing (%) 7.0%
Memory Size 6.8 MB

Length

Mean 2
Standard Deviation 0
Median 2
Minimum 2
Maximum 2

Sample

1st row VR
2nd row VR
3rd row VR
4th row VR
5th row VR

Letter

Count 211328
Lowercase Letter 0
Space Separator 0
Uppercase Letter 211328
Dash Punctuation 0
Decimal Number 0

ORIGINAL GROSS AMT

numerical

Distinct Count 27271
Unique (%) 24.0%
Missing 1
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1.7 MB
Mean 177.3717
Minimum -486980.09
Maximum 88918.85
Zeros 0
Zeros (%) 0.0%
Negatives 3754
Negatives (%) 3.3%

Quantile Statistics

Minimum -486980.09
5-th Percentile 2.5
Q1 12.95
Median 41.18
Q3 99.38
95-th Percentile 819.2
Maximum 88918.85
Range 575898.94
IQR 86.43

Descriptive Statistics

Mean 177.3717
Standard Deviation 2050.3119
Variance 4.2038e+06
Sum 2.016e+07
Skewness -118.8609
Kurtosis 29127.5831
Coefficient of Variation 11.5594

MERCHANT NAME

categorical

Distinct Count 10991
Unique (%) 9.7%
Missing 1
Missing (%) 0.0%
Memory Size 8.9 MB

Length

Mean 16.8575
Standard Deviation 4.307
Median 17
Minimum 2
Maximum 25

Sample

1st row shell kings 587
2nd row tesco pfs 6119
3rd row tex sussex sstn
4th row tex sussex sstn
5th row malthurst petroleu

Letter

Count 1647343
Lowercase Letter 1647343
Space Separator 174397
Uppercase Letter 0
Dash Punctuation 4289
Decimal Number 58886

TRANS CAC DESC 1

categorical

Distinct Count 136
Unique (%) 0.1%
Missing 79
Missing (%) 0.1%
Memory Size 8.7 MB

Length

Mean 15.131
Standard Deviation 4.0027
Median 15
Minimum 3
Maximum 28

Sample

1st row Vehicle Fuel
2nd row Vehicle Fuel
3rd row Vehicle Fuel
4th row Vehicle Fuel
5th row Vehicle Fuel

Letter

Count 1557359
Lowercase Letter 1293748
Space Separator 137860
Uppercase Letter 263611
Dash Punctuation 13
Decimal Number 396

TRANS CAC DESC 2

categorical

Distinct Count 1093
Unique (%) 1.0%
Missing 199
Missing (%) 0.2%
Memory Size 9.7 MB

Length

Mean 24.2894
Standard Deviation 8.696
Median 25
Minimum 4
Maximum 40

Sample

1st row African-Caribbean ...
2nd row Mobile Night Care ...
3rd row Shakti Elders Dce,...
4th row Shakti Elders Dce,...
5th row Enablement Tyburn ...

Letter

Count 2302699
Lowercase Letter 1835638
Space Separator 344889
Uppercase Letter 467061
Dash Punctuation 5621
Decimal Number 25482

Directorate

categorical

Distinct Count 17
Unique (%) 0.0%
Missing 738
Missing (%) 0.6%
Memory Size 8.4 MB

Length

Mean 13.1012
Standard Deviation 3.7552
Median 13
Minimum 1
Maximum 28

Sample

1st row Adult & Communitie...
2nd row Adult & Communitie...
3rd row Adult & Communitie...
4th row Adult & Communitie...
5th row Adult & Communitie...

Letter

Count 1295995
Lowercase Letter 540014
Space Separator 106863
Uppercase Letter 755981
Dash Punctuation 0
Decimal Number 0

Interactions

Correlations

Missing Values